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Abstract  -  This  paper  puts  forward  a  formal  basis  for  behavior-based  robotics,  using 
techniques  that  have  been  successful  in  control-theory-based  approaches  for  steering  and 
stabilizing  robots  that  are  subject  to  nonholonomic  constraints.  In  particular,  behaviors 
for  robots  are  formalized  in  terms  of  kinetic  state  machines,  a  motion  description  lan¬ 
guage,  and  the  interaction  of  the  kinetic  state  machine  with  real-time  information  from 
(limited  range)  sensors.  This  formalization  allows  us  to  create  a  mathematical  basis  for 
the  study  of  such  systems,  including  techniques  for  integrating  sets  of  behaviors.  In  addi¬ 
tion  we  suggest  optimality  criteria  for  comparing  both  atomic  and  compound  behaviors 
in  various  environments.  A  hybrid  architecture  for  the  implementation  of  path  planners 

‘This  research  was  supported  in  parts  by  grants  from  the  National  Science  Foundation’s  Engineering 
Research  Centers  Program:  NSFD  CDR  8803012,  the  AFOSR  University  Research  Initiative  Program, 
under  grant  AFOSR-90-0105,  and  AFOSR-F49620-92-J-0500,  from  NSF(IRI-9306580),  ONR  (N00014- 
J-91-1451),  AFOSR  (F49620-93-1-0065),  the  ARPA/Rome  Laboratory  Planning  Initiative  (F30602-93- 
C-0039  and  by  ARI  (MDA-903-92-R-0035,  subcontract  through  Microelectronics  and  Design,  Inc.) 
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that  uses  the  motion  description  language  is  also  presented. 


1  Introduction 

Traditional  robot  motion  planning  and  obstacle  avoidance  concentrates  on  determining 
a  path  in  the  presence  of  holonomic  or  integrable,  equality  and  inequality  constraints  on 
the  configuration  space.  In  practice  however  many  robotic  systems  include  constraints 
that  are  not  holonomic.  Such  kinematic  constraints  cannot  be  reduced  to  equivalent 
constraints  on  the  configuration  variables  which  are  explicit  functions  of  position  variables 
(e.g.  a  front  wheel  drive  car,  object  manipulation  by  rolling  with  a  robotic  hand). 

Often  such  drift-free  (completely)  nonholonomic  systems,  where  the  number  of  controls 
is  less  than  the  number  of  states,  are  controllable.  Papers  [1,  2,  3,  4]  present  analytical 
tools  based  on  Lie  algebras  to  generate  control  sequences  to  steer  such  systems.  The 
use  of  sinusoids  in  such  problems  is  already  anticipated  in  the  work  of  Brockett  on 
Singular  Riemannian  Geometry  [5].  As  these  nonholonomic,  drift-free  systems  do  not 
satisfy  Brockett’s  necessary  condition  for  smooth  stabilization  [6],  these  systems  cannot 
be  stabilized  to  the  origin  in  state  space  by  using  smooth  time-invariant  state  feedback. 
This  reinforces  the  need  for  alternatives  such  as  piecewise  smooth  feedback  controllers  [7], 
time-varying  periodic  controllers  [8]  and  explicit  control  design  to  generate  time-varying 
stabilizable  control  laws  [9]. 

While  a  majority  of  the  above  research  on  steering  and  stabilization  of  nonholonomic 
systems  [10]  assumes  an  obstacle-free  world,  we  note  that  the  problem  of  autonomous 
path  planning  and  obstacle  avoidance  with  nonholonomic  robots  is  a  nontrivial  one. 
Modeling  obstacles  as  constraints  in  the  configuration  space  and  then  designing  control 
laws  is  a  complex  problem.  In  addition  deriving  control  laws  for  limited  range  sensors 
with  imperfect  and  uncertain  information  poses  additional  problems. 

In  the  AI  literature  there  is  growing  interest  in  using  reactive,  sensor-based  behaviors 
to  solve  path  planning  problems  given  little  or  no  a  priori  information,  inspired  by  the 
work  of  Rodney  Brooks  [11].  The  idea  here  is  to  rely  on  the  direct  coupling  of  sensory 
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information  and  actuators.  This  approach  in  contrast  with  methods  based  on  control 
theory,  has  resisted  mathematical  formalization  and  is  not  amenable  to  tests  for  optimal¬ 
ity.  Arguably,  a  better  understanding  of  the  properties  of  nonholonomic  systems,  would 
enable  one  to  exploit  the  underlying  geometry  along  with  real-time  sensor  information 
for  path  planning  and  obstacle  avoidance.  Hence  there  is  a  need  for  a  framework  that  can 
capture  and  integrate  features  of  both  modern  control-theoretic  techniques  and  reactive 
planning  methods. 

In  this  paper,  we  introduce  a  motion  description  language  which  we  call  MDLe  (to 
denote  its  relationship  as  an  extension  of  Brockett’s  motion  description  language,  MDL, 
[12,  13]),  that  can  encode  and  integrate  aspects  of  modern  control  theory  approaches 
to  steering  nonholonomic  robots  with  those  of  reactive-planning  systems  that  rely  on 
the  direct  coupling  of  sensory  information  and  actuators.  This  is  done  by  introducing 
sensor-driven  trigger  functions  into  MDL  atoms.  This  motion  description  language  also 
gives  us  the  means  to  formalize  concepts  such  as  “behavior”,  “plan”  etc.  used  in  the 
path  planning  literature.  We  also  introduce  a  hybrid  architecture  for  path  planning  and 
obstacle  avoidance  that  utilizes  the  formalism  of  this  paper.  For  other  related  work 
inspired  by  Brockett’s  MDL,  see  [14]. 

In  section  2  we  present  details  of  MDLe,  examples  of  obstacle  avoidance  problems 
modeled  in  the  framework  of  the  language  and  suggest  some  optimality  criteria.  The 
hybrid  architecture  is  discussed  in  section  3.  A  brief  outline  of  a  planner  that  generates 
“behaviors”  for  path  planning  and  obstacle  avoidance  is  also  given.  Section  4  includes 
final  remarks  and  future  research  directions. 


2  Language  for  Motion  Planning 

We  treat  a  nonholonomic  robot  as  a  kinetic  state  machine  (following  Brockett  [12])  which 
can  be  thought  of  as  a  continuous  analog  of  a  finite  automaton.  In  the  framework  of 
MDLe  these  kinetic  state  machines  are  governed  by  differential  equations  of  the  form 


3 


(1) 


m 

X  =  h{x)ui\  y  =  h(x)  €  Rp 
2—1 

where 

rr(-)  :  R+  =  [0,  oo)  — »  Rn 
Ui  :  R+  xRp  4  R 

(t,y(t))  «i(2,|/(2)) 

Further  each  b{  is  a  vector  field  in  R”. 

We  now  define  the  atoms  of  the  motion  language  as  triples  of  the  form  (U,  £,  T)  where 

U  —  (itj ,  •  •  um) 


where  tq  is  as  defined  earlier, 

£:  Rfc  -4  {0,1} 

s(t)  M-  £(s(f)) 

is  a  boolean  function,  T  €  R+  and  s(-)  :  [0,T]  ->  Rfc  is  a  k  dimensional  signal  that 
represents  the  output  of  the  k  sensors.  £  can  be  interpreted  as  an  interrupt  or  trigger  to 
the  system  which  is  activated  in  a  case  of  emergency,  e.g.  the  robot  gets  too  close  to  an 
obstacle.  Let  us  denote  by  T,  (measured  with  respect  to  the  initiation  of  the  atom)  the 
time  at  which  an  interrupt  was  received  i.e.  f  changes  state  from  1  to  0.  The  definition 
of  an  atom  here  can  be  compared  with  that  in  MDL  [12]  where  Brockett  seems  to  treat 
time-outs  in  T,  instead  of  giving  explicit  status  to  triggers. 

If  at  time  t0  the  kinetic  state  machine  receives  an  input  atom  (17,  f ,  T)  the  state  will 
evolve  governed  by  the  differential  equation  (1),  as 

x  =  B(x)U,  V  t,  t0  <  t  <  t0  +  min[T,T]. 


If  the  kinetic  state  machine  receives  an  input  string  (17i,£i,Ti)  •  •  •  (Un,  £n,  Tn)  then  the 
state  x  will  evolve  according  to 

x  =  B(x)Ui,  t0<t<  t0  +  min[Ti,Ti\. 


x  =  B(x)Un,  to  H - 

<t<  to  -I - 1 -min[Tn,Tn\. 


(2) 
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Hence  we  may  denote  a  kinetic  state  machine  as  a  seven-tuple  (U,  X,  y,  S,  B,  h,  £),  where 

U  —  C0O(]R+  x  Rp;  Rm)  is  an  input  (control)  space, 

X  —  Rra  is  the  state  space, 
y  =  Rp  is  an  output  space, 

S  C  Rfc  is  the  sensor  signal  space, 

B  is  an  Rnxm  matrix  (kinematic  constraints  matrix), 
h  :  X  — >•  y  maps  the  state  space  to  the  output  space 
and 

£  :  S  -4  {0, 1}  maps  the  sensor  output  to  the  set  {0, 1}. 

As  another  point  of  departure  from  MDL,  we  find  it  useful  to  bring  input  scaling  into 
the  picture.  This  provides  considerable  flexibility  as  in  the  examples  of  section  2.1. 

Definition:  Given  an  atom,  (U,  £,T),  define  (all,  £,  /3T),  a  £  R,  (3  £  R+  as  the  corre¬ 
sponding  scaled  atom  and  denote  it  as  (a,  (3)(U,  £,T). 

Definition:  An  alphabet  E  is  a  finite  set  of  atoms,  i.e  (U,  £,  T)  triples.  Thus  E  = 
{ {TJ i ,  £i ,  T\ ) ,  •  •  • ,  (Un,  £n,  Tn)}  for  some  finite  n  or  equivalently  E  =  {a\,  •  •  • ,  on }  where 
<jj  denotes  the  triple  (f/,,£j,Tj),  such  that  crj  ^  ( a,(3)(aj )  a  £  R,  f3  €  R+  and  i  = 
1  ,••■«,  j  = 

We  find  it  very  useful  to  formalize  scaling  into  the  language  and  hence  introduce  the 
notions  of  extended  alphabet  and  language. 

To  simplify  notation  in  the  rest  of  the  discussion  we  denote  the  scaled  atom  (l,l)cq 
simply  by  cq. 

Definition:  An  extended  alphabet  Ee  is  the  infinite  set  of  scaled  atoms,  i.e.  triples 
( aU ,  £,  /3T)  derived  from  the  alphabet  E. 

Definition:  A  language  E*  (respectively  E*)  is  defined  as  the  set  of  all  strings  over  the 
fixed  alphabet  E  (respectively  extended  alphabet  Ee). 

Definition:  A  behavior  n  is  an  element  (i.e.  word)  of  the  extended  language  E*.  For  ex¬ 
ample,  given  an  alphabet  E  =  {cq,  a2},  a  behavior  7 r  could  be  the  string  (cq,  Pi)ai(a2,  fa) 

fa)°l- 
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Remark:  To  account  for  constraints  one  might  limit  behaviors  to  lie  in  a  sublanguage 
B  C  E*.  This  will  be  explored  in  future  work. 

Definition:  The  length  of  a  behavior  denoted  by  |7r|  is  the  number  of  atoms  (or  scaled 
atoms)  in  the  behavior. 

Definition:  The  duration  T(ir)  of  a  behavior 

■  (®tl  j  )  (Un,  ,  T{ j )  •  •  •  (ofj/ ,  Pn )  (Uii ;  f ii ,  Tj^) 

executed  beginning  at  time  t0  is  the  sum  of  the  time  intervals  for  which  each  of  the  atoms 
in  the  behavior  was  executed.  That  is, 

T(  7r)  =  to  +  min[Ti,  PhTh]  + - t-  min[Tn,  faTu]  (3) 

Definition:  Given  a  kinetic  state  machine  and  a  world-model,  a  plan  T  is  defined  as 
an  ordered  sequence  of  behaviors,  which  when  executed  achieves  the  given  goal.  For 
example  a  plan  F  =  {7r3 7Ti7Tn  •  •  •}  could  be  generated  from  a  given  language  where  each 
behavior  is  executed  in  the  order  in  which  they  appear  in  the  plan.  The  length  of  a  plan 
T  is  given  by  |T|  =  £  |7r,|  and  the  duration  of  the  plan  is  given  by  T(r)  =  Y^Tfa).  In  a 
particular  context  there  may  be  more  than  one  plan  that  achieves  a  given  goal. 

Remark:  Since  each  atom  when  executed  by  a  kinetic  state  machine,  combines  in  general 
both  open  loop  and  feedback  controls,  one  could  argue  that  our  definition  of  behavior 
captures  some  aspects  of  the  essence  of  locomotion  behavior  c.f.  [15],  as  well  as  the  sense 
in  which  the  term  is  used  by  Brooks  [11],  Further  the  passage  from  atoms  to  behaviors 
to  plans  suggests  a  layered  architecture  as  we  shall  see  below. 

We  now  state  a  proposition  that  to  some  extent  answers  the  question  of  the  existence 
of  an  alphabet  E  (respectively  Ee)  which  can  be  used  to  generate  behaviors  and  hence 
plans  to  achieve  the  required  goal. 

Proposition  1  Given  an  obstacle-free  environment  and  a  kinetic  state  machine  that  is 
governed  by  the  differential  equation 

m 

x  —  Y,bi(x)uf,  x  G  1Rn,u  G  lRm  (4) 

2=1 
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such  that  the  control  Lie  algebra  (i.e.  the  vector  space  spanned  at  any  point  by  all  the  Lie 
brackets  of  the  vector  fields  bi)  has  rank  n,  then  there  exists  an  alphabet  S  (respectively 
Te)  which  can  be  used  to  generate  behaviors  (and  hence  plans )  to  steer  the  system  from 
a  given  initial  state  x0  to  a  final  state  Xf. 

Proof:  From  Chow’s  [16]  theorem  we  know  that  if  the  control  Lie  Algebra  has  rank  n 
then  the  system  is  controllable.  This  implies  there  there  exist  piecewise  constant  controls 
u  :  [0,  T]  ->  Hm,T  >  0  that  steer  the  system  from  any  initial  state  xo(0)  to  any  final 
state  Xf(T). 

A  simple  alphabet  that  can  be  used  to  generate  behaviors  consists  of  m  triples  of  the 
form  (Ui,  1, 1),  •  •  • ,  (Um,  1, 1)  a  e  R,  (3  £  R+  where 

U\  =  (1,0, 0,0,  •  ••,())' 

U2  =  (0, 1,0,0,  •  •  -  ,0)' 

Um  =  (0, 0, 0,  •  •  • ,  0, 1)'  □ 

Example  1.  Consider  the  problem  of  path  planning  with  a  unicycle,  with  a  single  sensor, 
that  wanders  around  in  a  given  environment  without  colliding  into  obstacles  (analogous 
to  the  idea  of  the  first  level  of  competence  in  Brooks  [11]  ).  Let  us  assume  the  task  of  the 
robot  (unicycle)  in  this  case  is  to  wander  till  it  senses  an  obstacle.  If  it  senses  an  obstacle 
it  avoids  the  obstacle  and  continues  to  wander  around.  We  now  formulate  and  solve  this 
problem  treating  the  unicycle  with  its  sensor  as  a  kinetic  state  machine  and  find  a  plan 
that  solves  the  problem.  The  differential  equations  governing  the  kinetic  state  machine 
are 


X 

=  Vi  cos  9 

(5) 

y 

=  V\  sin  9 

(6) 

e 

=  v2 

(7) 

where  (x,  y)  £  1R2  denotes  the  position  of  the  unicycle  w.r.t  some  inertial  frame,  6  £  Sl 
denotes  the  orientation  of  the  unicycle  relative  to  the  horizontal  axis,  Vi  and  v2,  the 
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velocity  of  the  unicycle  and  the  angular  velocity  respectively  are  the  inputs  to  the  kinetic 
state  machine.  With  reference  to  the  standard  notation  (1),  we  identify  u\  =  v\,  u2  =  v2, 
bi  =  (cos  9,  sin  6, 0)'  and  b2  =  (0, 0, 1)'. 

To  generate  the  “wander  behavior”  (wander  in  a  given  environment  without  colliding 
into  obstacles)  let  us  consider  the  following  atoms: 


ffi  =  (Ui,£i,Ti)  where 

£4  =  (1,0)' 

1  if  p  >  10 
0  if  p  <  10 
T\  G  (0,  oo) 

where  p  is  the  distance  between  the  robot  and  the  obstacle  that  is  returned  by  the  sensor. 


6  = 


02  =  (U2,&,T2)  where 


03  =  (U^faTa)  where 


U2  =  (0, 1)' 


6  -  < 


0 

1 


if  p  >  10 
if  p  <  10 


T2  G  (0,  oo) 


U3  =  (0, 1)' 
6  =  1 
T3  G  (0,  oo) 


Let  a  G  [a-mim  amax]  and  f3  G  [0,oo].  Now  consider  the  following  atomic  behaviors 


7Ti  =  K,#)(£/i,6,1) 


tt2  =  (a?,/?i)(£/2,6,l) 

7r3  =  (^,^3)(C/3,6,l) 

Based  on  the  equations  of  this  robot,  the  behavior  7Ti  is  interpreted  as  “move  forward” 
with  a  velocity  of  a\  units/sec  for  f3\  seconds,  and  behaviors  7t2  and  7 r3  can  be  interpreted 
as  “turn”  with  a  velocity  of  a\  deg/sec  for  maximum  of  f3\  seconds  (here  i  =  1,2,  3)  i.e., 
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turn  counter  clockwise  by  a  maximum  of  degrees,  unless  interrupted.  As  explained 
earlier  the  atoms  of  each  behavior  will  only  execute  as  long  as  their  respective  £  functions 
are  1  and  the  time  of  execution  is  less  than  T.  Hence,  once  7t3  begins  executing  it  continues 
until  t  =  01  since  £3  =  1  in  the  entire  interval,  [0, /?)*]. 


Figure  1:  Trajectory  and  Inputs  Generated  by  the  Plan  Tx 
Consider  the  plan  I\  =  ((5, 100)7Ti( — 1, 90)7r3)*  i.e. 

Ti  =  {(a},/^1)^  (af,$)7r3  (alfl^TT!  (af,$)jr3  •  •  •} 

Observe  that,  if  this  plan  is  executed  in  the  environment  (with  walls  W1  and  W2)  as 
shown  in  the  Fig.  1,  the  robot  will  move  forward  for  time  t,  t0  <  t  <  t0  + 100,  where  to  is 
the  time  at  which  the  behavior  was  started,  when  £1  will  interrupt  it  (too  close  to  Wl). 
Let  us  assume  that  the  interrupt  was  received  at  to  +  T\.  The  execution  of  behavior  7Ti  is 
then  inhibited,  behavior  7t3  is  picked  up  from  the  queue  and  is  executed.  As  £3  =  1  in  the 
entire  interval  t  €  [to  +  T\ ,  to  +  Ti  +  90)  the  robot  will  then  turn  clockwise  by  90  degrees 
and  then  it  will  move  forward  (execute  behavior  7Ti).  But  again  after  some  finite  time 
wall  W2  (see  Fig.  1)  will  cause  £1  =  0  and  hence  interrupt  the  move  forward  behavior. 
Behavior  7 r3  is  executed  as  earlier  i.e.  the  robot  turns  clockwise  by  90  degrees,  and  now 
continues  to  move  forward.  If  it  does  not  detect  an  obstacle  at  the  end  of  100  seconds 
since  it  began  moving  forward,  it  will  stop,  turn  clockwise  by  90  degrees,  and  continue 
to  repeat  the  sequence  of  actions. 
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Now  consider  the  plan  T2  =  ((50,  2)7Ti(— 20,  5)7t2)*.  Observe  that,  if  this  plan  is 
executed  in  the  same  environment  (see  Fig.  2),  then  while  executing  the  “move  forward” 
i.e.  7Ti,  in  the  time  interval  t0  <  t  <  t0  +  2  the  robot  realizes  that  the  obstacle  is  at  a 
distance  less  than  10  units  from  it  and  hence  interrupts  the  “move  forward  ”  and  the 
robot  begins  to  execute  “turn  right” .  Due  to  the  choice  of  the  interrupt  function  £2  the 
robot  will  now  switch  between  “turn  right”  and  and  “move  forward”  (a  condition  reffered 
to  as  chattering)  and  trace  a  trajectory  as  shown  in  the  figure.  Hence  depending  on  the 
choice  of  the  alphabet  one  can  generate  different  plans  to  achieve  the  same  task. 


Figure  2:  Trajectory  and  Inputs  Generated  by  the  Plan  T2 

The  question  of  how  to  generate  a  plan  given  an  alphabet  and  a  kinetic  state  machine, 
is  an  open  one  and  it  largely  depends  on  the  task  and  the  planner.  In  section  3  we  describe 
a  path  planner  for  nonholonomic  robots.  Before  we  discuss  the  features  of  the  planner  we 
introduce  some  more  definitions  that  help  formalize  measures  to  evaluate  the  performance 
of  a  plan. 

2.1  Performance  Measure  of  a  Plan 

At  first,  it  appears  that,  to  generate  a  plan  to  steer  a  system  from  a  given  initial  state 
x0  to  a  final  state  Xf  requires  complete  a  priori  information  of  the  world,  which  is  not 
available  in  many  instances  of  path  planning.  In  the  absence  of  such  complete  a  priori 
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information  about  the  world  W,  then  the  planning  system  has  to  generate  a  sequence  of 
plans  based  on  the  limited  information  about  W  that  it  has,  which  when  concatenated 
will  achieve  the  required  goal.  We  refer  to  these  plans  that  are  generated  on  limited 
information  to  achieve  some  subgoal  as  partial  plans  F5.  The  plan  to  steer  the  system 
from  a  given  initial  state  x0  to  a  final  state  Xf  is  then  determined  after  the  system  has 
reached  the  final  state  and  is:  T  =  Ti  T2  •  •  •  Tn  where  Tj  is  the  partial  plan  consisting 
of  only  those  behaviors  and  atoms  in  each  behavior  that  have  been  executed  for  t  >  0. 

Remark:  As  a  partial  plan  is  generated  with  limited  information  of  the  world,  not  all 
the  behaviors  and  not  every  atom  in  a  behavior  generated  by  the  partial  plan  may  be 
executed  at  run  time  for  the  following  reasons: 

(i)  Let  us  consider  a  behavior  7q  =  <t3<7i<74  •  •  -on.  Let  us  assume  that  the  atom  er3  is 
interrupted  by  £3  at  t.  Now  as  explained  earlier  a1  will  begin  to  execute.  But  if  £3  —  £1 
the  atom  cr\  will  not  be  executed  and  depending  on  £4,  cr4  will  begin  to  execute. 

(ii)  For  practical  reasons  we  introduce  a  hierarchy  of  interrupts.  While  a  specific  behavior 
7 n  of  partial  plan  is  being  executed,  if  a  level  0  interrupt  is  received,  the  execution  of 
that  particular  atom  is  inhibited  and  the  next  atom  in  that  behavior  is  executed  just 
as  explained  in  (i)  above.  If  a  level  1  interrupt  is  received  while  a  behavior  is  being 
executed  the  execution  of  that  behavior  is  now  inhibited  and  the  next  behavior  in  the 
partial  plan  is  executed.  Finally  if  a  level  2  interrupt  is  received  the  execution  of  the 
remainder  of  the  current  partial  plan  is  stopped  and  a  new  partial  plan  is  executed. 

The  length  of  a  plan  is  given  by  |r|  =  £"=1  \^i\  and  the  time  of  execution  of  the  plan  is 
given  by  T(T)  =  £"=1  Tfa). 

With  these  formal  definitions,  we  can  start  discussing  the  performance  of  an  algorithm 
that  uses  these  behaviors  and  analyzing  some  earlier  algorithms  for  nonholonomic  motion 
planning. 

Given  an  algorithm  that  generates  a  plan  T  we  define  a  candidate  measure  of  perfor¬ 
mance  0(r)  of  the  plan  as 

e(r)  =  r(r)  +  r|r|  (8) 
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where  r  is  a  normalizing  factor  having  the  units  of  time.  (One  need  not  limit  oneself  to 
such  additive  combinations  although  this  is  the  only  case  used  here.) 

Defining  a  performance  measure  for  a  path  planner  is  a  difficult  task  as  it  is  dependent 
on  the  goal  the  robot  seeks  to  achieve.  Some  path  planners  use  the  total  time  to  achieve 
the  goal  as  a  measure  of  performance.  In  many  situations  one  might  be  interested  in  not 
only  the  time  but  also  on  the  smoothness  of  the  path  traversed  or  the  number  of  times 
switching  between  different  controls  was  necessary.  For  example  consider  the  task  of 
parallel  parking  of  a  car.  One  might  be  able  to  achieve  the  goal  by  using  only  open-loop 
controls  but  switching  between  them  at  regular  intervals,  hence  possibly  reducing  the 
time  to  achieve  the  goal  but  compromising  on  the  smoothness  of  the  path.  On  the  other 
hand  if  one  uses  a  time  dependent  feedback  law,  the  same  task  could  be  achieved,  possibly 
by  moving  along  a  smooth  trajectory  at  the  risk  of  taking  longer  time  to  achieve  the  goal. 
This  indicates  a  trade-off  between  two  competing  requirements  which  is  captured  by  the 
performance  measure  (8). 

We  now  define  the  optimal  performance  of  a  plan  as 

®(r)optimai  =  min{T{T)  +  t\T\}.  (9) 

Here  the  minimization  is  performed  over  the  subset  of  plans  generated  by  the  subset  B 
of  admissible  behaviors.  Depending  on  the  kinetic  state  machine  and  the  choice  of  the 
planner  one  can  now  place  bounds  on  the  optimal  performance  and  hence  compare  the 
performance  of  different  planners  given  the  same  language  or  that  of  the  planner  given  a 
new  language.  This  is  illustrated  in  the  example  given  below. 

Example  2:  Consider  the  problem  of  steering  the  unicycle  from  a  given  initial  location 
z0  to  Zf.  The  equations  of  the  unicycle  are  given  in  example  1.  Let  us  assume  that 
the  language  is  based  on  the  following  atoms.  cq  —  (C/i,£i,l),  o2  =  {U2,^ 2,1)  where 
Ui,  £i,  U2,  £2  are  as  defined  in  example  1.  Let  a  G  [—5,  +5]  and  (3  G  (0, 00). 

Let  us  also  assume  that  the  planner  did  not  have  complete  information  about  the 
world  and  had  to  generate  n  partial  plans  to  achieve  the  goal.  Each  partial  plan  consists 
of  steering  the  unicycle  from  Zi  to  Zj  (see  Fig  3)  such  that  there  are  no  obstacles  in  some 
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small  neighborhood  of  the  line  segment  joining  these  two  locations.  Let  us  make  a  further 
assumption  that  the  planner  uses  6  [1,  —1]  as  the  scaling  factor  while  generating  partial 
plans. 


Figure  3:  Partial  Plan  Generation 

From  the  kinematic  equations  of  the  unicycle  we  know  that  a  simple  partial  plan  to 
steer  a  unicycle  from  Zi  =  ( Xi,yi,6i )  to  Zj  =  (xj,yj,9j)  would  be  : 

(i)  turn  by  (9ZiZj  -  9t), 

(ii)  move  by  a  distance  di  and 
(hi)  finally  turn  by  (93  -  9ZlZ]), 

where  ZiZj  is  the  vector  in  1R2  joining  { x^yi )  and  ( Xj,yj ),  di  —  ||^^j|2  and  9ZiZj  is  the 
orientation  of  the  vector  w.r.t.  to  the  x-axis. 

We  can  rewrite  this  simple  algorithm  as  a  partial  plan  derived  from  the  language 
using  the  atomic  behaviors  7Ti  =  <7i  and  ir2  =  cr2, 

rPj  =  {{9n/\9n\,  iX,di)ai  (9i2/\9i2\,  \9i2\)o-2} 

where  9n  and  9i2  are  the  angles  of  the  two  turns  as  described  above  of  the  ith  partial 
plan.  Hence  the  plan  to  steer  the  system  from  za  to  Zf  is  given  by 

r  =  {rp1rp2---rpn} 
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Given  a  plan  we  now  illustrate  how  bounds  can  be  placed  on  the  optimal  performance 
based  on  the  knowledge  of  the  kinetic  state  machine  and  the  language.  Let  dmax  — 
max  HzjZjll. 


Tmax(^P  i)  5 
< 


27T  ~\~  dynax  2n 
47 r  dm(1x- 


and 

ir'ilmo.  <  3 

Hence, 

0  <  0(r)  <3 n  +  n(47r  4-  dmax). 

However,  as  we  are  using  only  open-loop  controls,  we  know  from  the  kinematics  of  the 
system  that  given  an  initial  state  x0  and  a  final  state  Xf  both  the  behaviors 
and  (koti,  0i/k)oi  would  steer  the  kinetic  state  machine  from  the  initial  state  to  the  final 
state.  Hence  we  could  replace  ( ai,(3i)cri  by  (kai}  fiifk)oi. 

Observe  that  in  the  generation  of  the  above  partial  plan  and  in  the  calculation  of  the 
performance  measure  we  restricted  a*  to  {—1,1}  (in  some  sense  placed  bounds  on  the 
velocity  of  the  unicycle)  because  the  planner  did  not  have  complete  information  about 
the  world.  But  since  the  language  permits  a,  E  [—5, 5],  we  have 

0  <  e(r)0*M  <  ± <W)  +  3n. 


Having  placed  bounds  on  a  plan  generated  by  one  set  of  behaviors  we  can  now  compare 
the  performance  of  another  set  of  behaviors  (for  example,  one  using  periodic  functions 
to  steer  the  robot)  against  these  bounds. 

In  the  above  examples  we  have  used  very  simple  controls  in  our  alphabet.  But  one 
should  note  that  depending  on  the  application,  a  wide  variety  of  controls  (open  loop  and 
closed  loop)  could  be  included  in  the  alphabet  and  some  examples  of  such  controls  can 
be  found  in  [1,  2,  3,  7,  8,  9,  17] 
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The  question  of  how  to  generate  a  plan  given  an  alphabet  and  a  kinetic  state  machine, 
is  an  open  one  and  it  largely  depends  on  the  task.  In  the  next  section  we  describe  a  control 
architecture  and  a  few  details  of  a  path  planner  for  nonholonomic  robots. 

3  Hybrid  Architecture 

As  we  seek  to  attain  higher  levels  of  autonomy  in  robots,  the  need  for  hierarchical  and 
distributed  control  schemes  becomes  apparent.  Motivated  in  part  by  the  hierarchical 
structure  of  neuromuscular  control  system  we  present  a  control  architecture  (see  Fig.  4), 
to  generate  and  execute  plans  to  achieve  a  given  task.  (To  avoid  clutter,  only  one  level 
of  interrupts  is  given.)  The  lowest  level  is  the  kinetic  state  machine  with  sensors,  where 
the  sensors  are  used  in  a  low-level  feedback  loop.  The  planner  could  be  interpreted  as 
the  higher  end  of  the  system  where  sensory  information  has  been  processed  to  generate 
goal-related  trajectory  information.  The  layered  and  distributed  nature  of  the  control 
becomes  apparent  when  one  observes  that  once  a  plan  has  been  generated  each  level 
and  even  various  modules  at  the  same  level  (e.g.  cleanup  and  plan)  continue  to  execute 
independently. 

The  task  of  the  planner  is  to  use  the  limited-range  sensor  information,  to  generate 
partial  plans  that  result  in  collision-free  feasible  trajectories.  Planning  is  done  at  two  lev¬ 
els  -  global  and  local.  For  local  planning,  obstacle  free  (non)feasible  paths  are  generated 
using  potential  functions  assuming  that  the  robot  is  holonomic.  A  partial  plan  (feasible 
path)  is  then  generated  that  obeys  the  constraints  in  the  configuration  variables.  As 
feasible  trajectories  are  only  approximations  to  the  trajectories  generated  using  potential 
functions,  collision  with  obstacles  could  occur  while  tracing  them.  While  the  robot  is 
in  motion,  collisions  are  avoided  by  using  the  sensor  information  to  trigger  interrupts  as 
described  previously. 

At  a  global  level,  heuristics  along  with  the  world  map  generated  while  the  robot  is 
en  route  to  the  goal  are  used  to  solve  the  problem  of  cycles. 

Once  a  partial  plan  has  been  generated  it  is  executed  as  explained  in  section  2.  Let  us 
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assume  that  an  atom  ([/,,  £*,  Ti)  is  executed  at  time  t  —  to-  In  Fig.  4,  T  is  a  timer  whose 
output  is  1  (active  high)  while  t0  <  t  <  Ti  and  is  0  (active  low)  if  t  >  t0  +  T*.  &(s(t)) 
returns  an  interrupt  (active  low)  to  the  system  when  conditions  defined  by  £i{s(t))  are 
satisfied.  Observe  here  that  the  interrupt  could  be  of  level  2,1  or  0.  Hence  the  functioning 
of  the  AND  gates  in  the  kinetic  state  machine  can  be  interpreted  as  follows  -  if  either  the 
robot  receives  an  interrupt  or  t  >  t0  +  T  the  input  to  gate  II  is  an  active  low  and  hence 
the  input  to  the  kinetic  state  machine  is  inhibited  i.e.  the  current  atom/behavior/partial 
plan  (depending  on  the  interrupt  level)  is  stopped  and  the  next  atom/behavior /partial 
plan  in  the  queue  is  executed  (see  Remark  (ii)  in  section  2.1). 

Fig.  5  shows  a  simulation  of  the  path  generated  by  the  planner.  As  the  partial  plans 
are  generated  based  on  local  information  (denoted  by  obstacle  free  disks  in  Fig.  5)1, 
the  paths  initially  generated  are  locally  optimal.  As  the  knowledge  of  the  environment 
increases,  the  performance  of  the  system  improves.  Fig  6  shows  an  example  of  a  plan 
generated  before  and  after  partial  knowledge  of  the  world  has  been  gained.  The  bold 
solid  lines  denote  the  new  trajectories  (partial  plans).  Observe  that  the  length  of  the  plan 
has  decreased  by  less  than  a  third.  More  details  on  the  planner  can  be  found  in  [18]. 


4  Final  Remarks 

The  motion  description  language  along  with  the  control  architecture  serves  as  an  ab¬ 
straction  between  continuous  and  discrete  time  control  strategies.  Current  directions 
of  research  include  continuing  formalization  of  behavior-based  robotics.  This  includes 
expanding  the  alphabet  to  include  multiple  kinetic  state  machines  and  communication 
protocols  between  these  machines.  We  are  also  working  on  extending  the  technique  used 
by  the  planner  to  generate  plans  in  the  presence  of  moving  obstacles. 

1It  should  be  pointed  out  here  that  the  obstacle- free  disks  generated  by  the  planner  are  not  entirely 
obstacle  free,  but  this  is  because  in  the  simulator  we  have  used  only  sensors  of  the  ‘eye’  to  generate 
obstacle-free  disks.  For  now,  those  obstacles  that  are  not  detected  by  the  sensors  are  treated  as  being 
in  the  blind  spots  of  the  robot. 
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Figure  4:  Hybrid  Control  Architecture 
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Obstacles 


(^j  Obstacle  free  disks 


Figure  5:  Paths  Generated  by  the  Planner 
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Figure  6:  Paths  Generated  by  the  Planner 
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